Class visualization of high-dimensional data with applications
نویسندگان
چکیده
Consider the problem of visualizing high-dimensional data that has been categorized into various classes. Our goal in visualizing is to quickly absorb inter-class and intra-class relationships. Towards this end, class-preserving projections of the multidimensional data onto twodimensional planes, which can be displayed on a computer screen, are introduced. These class-preserving projections maintain the high-dimensional class structure, and are closely related to Fisher’s linear discriminants. By displaying sequences of such two-dimensional projections and by moving continuously from one projection to the next, an illusion of smooth motion through a multidimensional display can be created. We call such sequences class tours. Furthermore, we overlay class-similarity graphs on our two-dimensional projections to capture the distance relationships in the original high-dimensional space. We illustrate the above visualization tools on the classical Iris plant data, the ISOLET spoken letter data, and the PENDIGITS on-line handwriting data set. We show how our visual examination of the data can uncover latent class relationships.
منابع مشابه
An Improved Lvq Algorithm with Data-structure Preserving Visualization
Data-structure preserved visualization of high-dimensional data reveals the dataset borders and the spread and overlapping tendency of the class borders in a more informative manner than the usual data-topology preserved mapping produced by SelfOrganizing Maps (SOMs). Hence, an extension of SOM called Probabilistic Regularized SOM (PRSOM) is proposed for the data-structure preservation in the v...
متن کاملSolving a class of nonlinear two-dimensional Volterra integral equations by using two-dimensional triangular orthogonal functions
In this paper, the two-dimensional triangular orthogonal functions (2D-TFs) are applied for solving a class of nonlinear two-dimensional Volterra integral equations. 2D-TFs method transforms these integral equations into a system of linear algebraic equations. The high accuracy of this method is verified through a numerical example and comparison of the results with the other numerical methods.
متن کاملHow Porous Nanofibers Have Enhanced the Engineering of Advanced Materials: A Review
Nanofibers are one-dimensional nanomaterialswith a superfine diameter and many potential applicationsdue to their desirable characteristics such as small diameter,high surface area, high flexibility, high porosity, and specialmechanical properties. In the recent years, porous nanofibershave been the subject of considerable research works in awide range of app...
متن کاملA new approach for data visualization problem
Data visualization is the process of transforming data, information, and knowledge into visual form, making use of humans’ natural visual capabilities which reveals relationships in data sets that are not evident from the raw data, by using mathematical techniques to reduce the number of dimensions in the data set while preserving the relevant inherent properties. In this paper, we formulated d...
متن کاملDoubly supervised embedding based on class labels and intrinsic clusters for high-dimensional data visualization
Visualization of data can assist decision-making processes by presenting the underlying information in a perceptible manner. Many dimension reduction techniques have been proposed to generate faithful visualization snapshots given high-dimensional data. When class labels associated with the data are already provided, supervised dimension reduction methods, which utilize such pre-given label inf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 41 شماره
صفحات -
تاریخ انتشار 2002